Open-world probabilistic databases: Semantics, algorithms, complexity
نویسندگان
چکیده
Large-scale probabilistic knowledge bases are becoming increasingly important in academia and industry. They continuously extended with new data, powered by modern information extraction tools that associate probabilities base facts. The state of the art to store process such data is founded on databases. Many systems based databases, however, still have certain semantic deficiencies, which limit their potential applications. We revisit semantics argue closed-world assumption i.e., facts not appearing database probability zero, conflicts everyday use large-scale bases. To address this discrepancy, we propose open-world as a model. In model, unknown facts, also called open can be assigned any value from default interval. Our analysis entails our model aligns better many real-world tasks query answering, relational learning, completion, rule mining. make various technical contributions. show complexity dichotomy, between polynomial time , for evaluating unions conjunctive queries databases lifted This result supported an algorithm computes so-called safe efficiently. Based algorithm, prove linear under reasonable assumptions. remains true more restricted class queries. extend beyond queries, obtain host results both classical conclude in-depth investigation combined respective models.
منابع مشابه
Open-World Probabilistic Databases
Large-scale probabilistic knowledge bases are becoming increasingly important in academia and industry alike. They are constantly extended with new data, powered by modern information extraction tools that associate probabilities with database tuples. In this paper, we revisit the semantics underlying such systems. In particular, the closed-world assumption of probabilistic databases, that fact...
متن کاملOpen-World Probabilistic Databases: An Abridged Report
Large-scale probabilistic knowledge bases are becoming increasingly important in academia and industry alike. They are constantly extended with new data, powered by modern information extraction tools that associate probabilities with database tuples. In this paper, we revisit the semantics underlying such systems. In particular, the closed-world assumption of probabilistic databases, that fact...
متن کاملOpen World Probabilistic Databases (Extended Abstract)
Introduction and Motivation Driven by the need to learn from vast amounts of text data, efforts throughout natural language processing, information extraction, databases. and AI are coming together to build large-scale knowledge bases. Academic systems such as NELL [14], Reverb [7], Yago [11], and DeepDive [16] continuously crawl the web to extract relational information. Industry projects such...
متن کاملA Possible World Semantics for Disjunctive Databases
We investigate the fundamental problem of when a ground atom in a disjunctive database is assumed false. There are basically two diierent approaches for inferring negative information for disjunctive databases; they are Minker's Generalized Closed World Assumption (GCWA) and Ross and Topor's Disjunctive Database Rule (DDR). A problem with the GCWA is that disjunctive clauses are sometimes inter...
متن کاملProbabilistic Algorithms and Complexity Classes
The main theme in this dissertation is a computation on a probabilistic Turing machine (PTM), which is a Turing machine with distinguished states called coin-tossing states. In general, it is assumed that a PTM has a fair coin. This is led by that any xed biased coin can simulate another xed biased coin in a constant expected time. We rst consider the computation which will always terminates in...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Artificial Intelligence
سال: 2021
ISSN: ['2633-1403']
DOI: https://doi.org/10.1016/j.artint.2021.103474